Rank in Wordlist | Frequency | Word |
---|---|---|
3024 | 56 | 1,5 |
3984 | 43 | 2,5 |
6169 | 28 | 3,5 |
7394 | 23 | 1,2 |
7395 | 23 | 1,8 |
7698 | 22 | 1,3 |
9235 | 18 | 4,5 |
12326 | 13 | 5,5 |
13245 | 12 | 1,6 |
14303 | 11 | 0,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
2653 | 63 | 20% |
2967 | 57 | 30% |
3361 | 51 | 50% |
3404 | 50 | 10% |
3561 | 48 | 5% |
4287 | 40 | 15% |
4289 | 40 | 70% |
4770 | 36 | 25% |
5061 | 34 | 100% |
5062 | 34 | 40% |
Rank in Wordlist | Frequency | Word |
---|---|---|
32369 | 4 | S&P |
52438 | 2 | R&B |
80680 | 1 | AK&M |
80681 | 1 | AK&M-List |
80821 | 1 | Arnold&Son |
81012 | 1 | Blue&MeTM |
81458 | 1 | E&I |
82444 | 1 | M&M’s |
83376 | 1 | Standard&Poor’s |
83444 | 1 | T&A |
Rank in Wordlist | Frequency | Word |
---|---|---|
62127 | 2 | грн/$1 |
77727 | 1 | 147,11/$1 |
77728 | 1 | 147,34/$1 |
79137 | 1 | 338,73/$1 |
80418 | 1 | 85,4/$1 |
83609 | 1 | US$0,704 |
83610 | 1 | US$21.16/унц |
83611 | 1 | US$22.28 |
114432 | 1 | Т138,93/$1 |
114433 | 1 | Т140,50/$1 |
Rank in Wordlist | Frequency | Word |
---|---|---|
7111 | 24 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
39739 | 3 | Sotheby's |
52178 | 2 | Christie's |
52284 | 2 | Harper's |
77147 | 1 | 1'st |
80848 | 1 | Australia's |
81092 | 1 | C'Space |
81093 | 1 | C'est |
81428 | 1 | Don't |
81861 | 1 | Guns N' Roses |
82017 | 1 | ILA'2004 |
Rank in Wordlist | Frequency | Word |
---|---|---|
32091 | 4 | 1+1 |
77347 | 1 | 105+132 |
77386 | 1 | 11+1 |
77989 | 1 | 18+Киев |
78148 | 1 | 193+240 |
78303 | 1 | 2+2 |
78304 | 1 | 2+4 |
79868 | 1 | 6+2 |
80876 | 1 | B+C1+C2 |
81297 | 1 | Ctrl+Shift+Esc |
Rank in Wordlist | Frequency | Word |
---|---|---|
11131 | 15 | м/с |
12452 | 13 | Синьхуа/ |
14307 | 11 | 1/8 |
18663 | 9 | ц/га |
18722 | 8 | 1/4 |
19445 | 8 | грн/л |
21487 | 7 | б/у |
21734 | 7 | грн./ |
22060 | 7 | км/час |
23779 | 6 | Trend/ |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots